Bimodal coherence based scale ambiguity cancellation for target speech extraction and enhancement
نویسندگان
چکیده
We present a novel method for extracting target speech from auditory mixtures using bimodal coherence, which is statistically characterised by a Gaussian mixture modal (GMM) in the offline training process, using the robust features obtained from the audio-visual speech. We then adjust the ICA-separated spectral components using the bimodal coherence in the time-frequency domain, to mitigate the scale ambiguities in different frequency bins. We tested our algorithm on the XM2VTS database, and the results show the performance improvement with our proposed algorithm in terms of SIR measurements.
منابع مشابه
A New Method for Speech Enhancement Based on Incoherent Model Learning in Wavelet Transform Domain
Quality of speech signal significantly reduces in the presence of environmental noise signals and leads to the imperfect performance of hearing aid devices, automatic speech recognition systems, and mobile phones. In this paper, the single channel speech enhancement of the corrupted signals by the additive noise signals is considered. A dictionary-based algorithm is proposed to train the speech...
متن کاملThe Relationship between Iranian EFL Learners’ Ambiguity Tolerance and the Accuracy of Their Task-based Oral Speech
Various individual differences, including ambiguity tolerance (AT), have gained momentum because of the influence they can exert on the process and product of learning, and thereby, on various aspects of the learner’s interlanguage system such as accuracy of oral speech. The present study was undertaken to examine the extent to which Iranian EFL learners’ AT was significantly correlated with th...
متن کاملSector-Based Detection for Hands-Free Speech Enhancement in Cars
Adaptation control of beamforming interference cancellation techniques is investigated for in-car speech acquisition. Two efficient adaptation control methods are proposed that avoid target cancellation. The “implicit” method varies the step-size continuously, based on the filtered output signal. The “explicit” method decides in a binary manner whether to adapt or not, based on a novel estimate...
متن کاملSingle-Microphone Speech Enhancement Inspired by Auditory System
Title of dissertation: Single-Microphone Speech Enhancement Inspired by Auditory System Majid Mirbagheri, Doctor of Philosophy, 2014 Dissertation directed by: Professor Shihab Shamma, Department of Electrical and Computer Enhancing quality of speech in noisy environments has been an active area of research due to the abundance of applications dealing with human voice and dependence of their per...
متن کاملA new metric for selecting sub-band processing in adaptive speech enhancement systems
A multi-microphone adaptive speech enhancement system employing diverse sub-band processing is presented. A new robust metric is developed, which is capable of real-time implementation, in order to automatically select the best form of processing within each sub-band. It is based on an adaptively estimated inter-channel Magnitude Squared Coherence (MSC) relationship, which is used to detect the...
متن کامل